Speech enhancement using 2-D Fourier transform
نویسندگان
چکیده
This paper presents an innovative way of using the two-dimensional (2-D) Fourier transform for speech enhancement. The blocking and windowing of the speech data for the 2-D Fourier transform are explained in detail. Several techniques of filtering in the 2-D Fourier transform domain are also proposed. They include magnitude spectral subtraction, 2-D Wiener filtering as well as a hybrid filter which effectively combines the one-dimensional (1-D) Wiener filter with the 2-D Wiener filter. The proposed hybrid filter compares favorably against other techniques using an objective test.
منابع مشابه
2-D Speech Enhancement based on Curvelet Transform using Different Window Functions
In this paper, an improved method based on Curvelet Transform using different window functions is presented for the speech enhancement. The window function is used for preprocessing of speech signals. In this method, instead of using two-dimensional (2-D) discrete Fourier Transform, Curvelet transform is employed with spectral magnitude subtraction method. General terms Spectral Substraction Me...
متن کاملSTFT-based speech enhancement by reconstructing the harmonics
A novel Short Time Fourier Transform (STFT) based speech enhancement method is introduced. This method enhances the magnitude spectrum of a noisy speech segment. The new idea that is used in this method is to basically reconstruct the harmonics at the multiples of the fundamental frequency ( 0 F ) rather than trying to improve them. The harmonics are produced, in the magnitude spectrum, using t...
متن کاملStudies on Wavelet Based Linear Prediction Coefficients for Natural Sounding Speech using Different Windowing Techniques
This paper is aimed at to study LPC for Natural speech with different windowing techniques the effect of window shape on improving the speech quality by reducing the noise with the help of fixed and variable Windows with optimum shape. In the speech process signal corrupted by noise is segmented into frames and each segment is windowed using different Window with variation in the shape paramete...
متن کاملSpeech Enhancement Using Beta-order Mmse Spectral Amplitude Estimator with Laplacian Prior
This report addresses the problem of speech enhancement employing the Minimum Mean-Square Error (MMSE) of β-order Short Time Spectral Amplitude (STSA). We present an analytical solution for β-order MMSE estimator where Discrete Fourier Transform (DFT) coefficients of (clean) speech are modeled by Laplacian distributions. Using some approximations for the joint probability density function and t...
متن کاملSpeech Enhancement in the Dft Domain Using Laplacian Speech Priors
In this paper we consider optimal estimators for speech enhancement in the Discrete Fourier Transform (DFT) domain. We derive an analytical solution for estimating complex DFT coefficients in the MMSE sense when the clean speech DFT coefficients are Laplacian distributed and the DFT coefficients of the noise are Gaussian or Laplacian distributed. We show that these estimators have a number of i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Speech and Audio Processing
دوره 11 شماره
صفحات -
تاریخ انتشار 2003